Computer simulations of language change notes

This website collects my personal notes on Computer simulations of language change. These notes are provided to bring full transparency to my research process. Of course, since they are only notes, they do not reflect my final thoughts on a topic, and should not be interpreted as such. To read finished papers, please consult my website. Do not use these notes as a basis for your own scientific research. Start from high-quality, peer-reviewed scientific literature instead.

03. The Schelling Chapter

p. 53

this chapter

segregation
model by Thomas Schelling
one of the most influential models of human social behaviour

further

how to turn an idea into a model
then: how to analyse that model
paying particular attention to assumptions
- decomposition and its consequences

The puzzle of segregation

segregation

individuals of different races or ethnicities tend to be geospatially clustered

p. 54

Map of racial and ethnic divisions in the New York City metropolitan area

Red is White
Blue is Black
Green is Asian
Orange is Hispanic
Yellow is Other
Each dot is 25 residents. Data is from the 2010 U.S. Census.

Possible explanations for segregation

1. organised action

only certain classes of individual are permitted to live in certain neighbourhoods

p. 55

2. socioeconomic filters

race or ethnicity, once associated with different socioeconomic classes, retain those associations
structural differences in opportunities and incentives for advancement and mobility

3. individual preferences / homophily

individuals may choose to live near others who are similar

↳ why take individual preferences into account?

1. individual preferences can have an effect on population structure
2. (more generally) relationship between individual preferences and population-level patterns are interesting

↓

↳ Schelling model

how do homophilic inclinations translate to segregated spatial distributions?
in absence of other factors, to what extent do individual preferences to live near similar neighbours influence the population-level pattern of segregation?

A model of segregation

Requirements

model as a drastic simplification

capturing the essence of segregation

individuals

home location + group identity (race)
with preferences of proportion of neighbours they can tolerate belonging to a different group
if their neighbourhood falls below their preference threshold, they move

A funny thing about building models is that a model almost always contains models within it. A model of segregation needs a model of geographical space and models of individuals’ ethnicity and behaviour. It’s models all the way down.

group identities

‘at least two’

other facilities

a minimal model of geographical space
a mechanism for changing location

p. 56

Spatial properties

defining neighbourhoods

requires an explicit spatial structure
here: discrete square grid / square lattice

The grid is discrete rather than continuous (as it was in the Particle World model) because dwellings in most urban areas are discrete. Homes tend to have fixed locations, and you cannot move your home arbitrarily close to another.

p. 57

A square lattice

p. 56

Important properties of a square grid

1. intuitive to visualise

simple to identify a neighbourhood, its race and its neighbours (and their races)

2. symmetrical

all agents will have an equal number of neighbours
(assuming infinite size)

boundaries and boundary effects

agents along boundaries have fewer neighbours → effects
to avoid this: toroidal boundaries

3. no effects of preferred axis of orientation

not one area is more desirable or has more/less capacity of residency

4. easy to code

it’s simple

Properties a square grid ignores

1. features common to real cities influencing desirability

density of housing
natural features (parks, rivers)

p. 56-57

You might find yourself a bit uncomfortable with this oversimplification. This discomfort is healthy— hold on to it. It is important to keep in mind the simplifying assumptions one makes when modelling, because the results of our simulations are direct consequences of those assumptions. […] Nevertheless, we must always do violence to reality to make any sense of our world, and we have to start somewhere.

p. 57

Agent properties

agent location

one cell in the lattice

agent ethnicity

‘real’ ethnicity not needed
i.e. one of two colours (easy for viz)

↳ problems

1. not all identities are clear-cut and observable
2. one can have multiple identities at once

Because this is a simple baseline model, this is all OK.

p. 58

agent behaviour

stay if the fraction of similar neighbours is sufficiently high, and move otherwise

It is worth noting that other formal decision rules consistent with the idea that “individuals prefer to live near similar neighbors” are possible, and that using these may somewhat change the model outcomes (Bruch and Mare, 2006).

‘neighbourhood’

several ways to operationalise a neighbourhood

Types of neighbourhoods
Moore neighbourhood	Von Neumann Neighbourhood

eight closest cells	four closest cells
includes diagonals	cardinal directions only

↳ larger neighbourhoods

larger squares, more area
when you want agents to assess or otherwise interact with spatial neighbours on a square lattice

Neighbourhood extensions
	r = 1	r = 2	r = 3
Moore neighbourhood
Von Neumann neighbourhood

In bounded finite spaces, the neighbourhoods will become truncated, leaving agents near the edges of the space with fewer neighbours than those nearer the middle. Toroidal neighbourhoods fix this issue again.

Dynamics

How does the world change?

time step core loop

each time step, every agent takes one of two actions
1. do nothing and remain where it is
2. move

neighbourhood calculation

other agents can be perceived, and neighbourhood proportion can be calculated

An agent doesn’t necessarily mind being a minority in its neighbourhood, but it wants to have at least some similar neighbours.

p. 59

⇒ If the proportion of my neighbours who are the same colour as I am is below my tolerance threshold, move. Otherwise, stay put.

‘moving’

search at random for a new empty cell which satisfies the conditions
happens until equilibrium is reached

↳ equilibrium

stable change that will not change further if the system is not perturbed

Outcomes

What do we want to learn from this model?

once equilibrium

how segregated is the population?
⇒ measure of segregation

↓

1. average similarity

have each agent count the proportion of its neighbours that are the same colour as itself
averaged over all agents
for no segregation: should be 0.5

2. other measures TODO exploration

p. 64

A formal description of the model

Explaining simple ⟷ complex models
simple models	complex models
can be fully described in just a few paragraphs	may require you to partition your model description ↓ important computations in the main text, specific implementation in the appendix

Partitioning your model description is advised for readability

variables

i.e. grid with 𝐿 𝑥 𝐿 size (rather than specifying the size)
makes model description general
also reminds you of choices you make for parameter vlaues

Initialisation

Consider an 𝐿 × 𝐿 square grid with toroidal boundaries. At initialization, one agent is placed upon each cell with a probability p, which characterizes the population density relative to the available space (0 < 𝑝 < 1, so that each agent can relocate to an empty location). The expected population size is therefore 𝑁 = 𝑝𝐿². The population is divided into 𝐺 groups, such that each agent 𝑖 is randomly assigned a fixed group identity 𝑔_𝑖 ∈ {1, 2, . . . , 𝐺}, each chosen with equal probability. For all of our analyses, we will use 𝐿 = 51 and 𝐺 = 2. The population is also defined by a similarity threshold, 𝑆, which defines the minimum proportion of an agent’s neighbors that must be similar for it to refrain from moving. Note that the use of probabilistic assignments means that, even holding 𝑝 and 𝐺 constant, there will be some variation between simulation runs in terms of exactly how large the population is and how many agents belong to each group. This stochasticity is often seen as a positive because it allows us to assess how robust the model is to minor fluctuations. However, one could also impose stricter requirements, so that, for example, the population size was always the nearest integer value of 𝑝𝐿².

p. 62-63

Dynamics

At any given time, each agent is either “happy,” in which case its neighbors are sufficiently similar and it will not move, or “unhappy,” in which case it will move because its neighborhood does not contain enough similar neighbors.

At the beginning of each time step, each agent 𝑖 first determines whether or not it is happy. The agent considers the other agents in its neighborhood (defined as a Moore neighborhood with radius r =1) and determines the proportion of its neighbors that share its group identity, 𝑠_𝑖. If 𝑠_𝑖 < 𝑆, the agent is unhappy, otherwise it is happy. After each agent has made this assessment, each unhappy agent, in random order, moves to a random empty cell (happy agents remain in their current locations). These dynamics repeat until no agents are unhappy. And they all lived happily ever after.

p. 62

Thinking about consequences

possible dynamics

think about what your model could do

cascade effects

movement of one agent sets in motion an entire reshuffling

p. 63

Coding the model

p. 68

The power of play

p. 69

play

good for getting to know your model
understanding how the dynamics arise

Schelling conclusions

even when everyone is content to be in the minority, most agents end up in highly segregated neighbourhoods in which they are the majority

p. 70

Model analysis

model analysis

quantifying a model’s behaviour more precisely

Batch, please

models based on equations	models using agents
deterministic	stochastic
mathematically tractable	mathematically untractable
we precisely can compute how model parameters relate to outcomes	we cannot compute how model parameters relate to outcomes
less realistic	more realistic

↓ how to report on stochastic outcomes?

↳ batch run / batch

generate representative distributions of outcomes possible under each set of parameters being investigated
then: descriptive statistics of model outcomes across simulations

# of runs required?

depends
if model is very stochastic, you need more runs

how long to run a simulation?

until natural stopping condition
fixed number of steps

p. 71

Parameter sweeps

different parameters,
different outcomes

systematically vary the parameter values
then record: how does the system respond?

dynamic range of parameters

within ‘reasonable ranges’

We should use parameters that are meaningful or realistic.

consider arbitrary / implicit assumptions

not all decisions in your model may be as straight-forward

Implicit assumptions of the Schelling model

There are only two groups in the population.
Each group is roughly the same size.
The quality of a neighborhood is completely determined by the trait makeup of an individual’s neighbors on the nearest eight patches.
There are no costs to moving.
Decisions to move are all-or-none based on a threshold.
All agents have identical thresholds for similarity.

↳ problem with assumptions

it is difficult to consider the consequences of every alternative assumption
will lead you much too far

curse of dimensionality

the number of simulations needed increases exponentially with the number of parameters

p. 72

Playing with your model can help you realize which parameters are probably important to explore and at which range of values, as well as which parameters you can more safely ignore, or run for only a couple of values to ensure robustness.

Null models

null conditions

eliminate mechanisms hypothesised to generate the main outcomes of your model
serve as a baseline to illustrate importance of a certain influence

Where are the stats?

inferential statistics?

regressions, analyses of variance
👁 Chapter 10 for why no inferential statistics

p. 73

Analysing the segregation model

BehaviorSpace

To what extent do individual preferences for living among similar neighbors yield segregated communities?

↳ modus operandi

vary thresholds, then check the effect (and quantitative measure)

here

model capped at 100 steps
usually 100 steps is enough to catch an equilibrium

how many runs?

here: 100 for each combination

p. 76

Results

qualitative results

1. as similarity-threshold increases, we get more segregation
2. average percent of similar neighbours is always considerably higher than the minimum acceptable percent of similar neighbours that agents will tolerate
⇒ if agents move when their preferences are not met, even weak individual preferences can generate strong patterns of segregation at the population level 3. the effect of population density is quite large

At lower densities, the population becomes much more segregated than it does at high densities. This is partly because at lower densities, individuals are likely to have only one or two neighbors, making the threshold harder to reach. Imagine you are in an isolated cluster of three agents. If you have only two neighbors, the only way to have at least 30% of them be the same color as yourself is for at least one of them to be that color. But if the remaining agent is a different color, then 100% of their neighbors are a different color, and so they will move.

tipping point events

events which change the whole entire equilibrium of the model
in this case, much more common at lower densities

p. 75

p. 76

Loner behaviour

depressed loners?

changing behaviour so agents don’t want to be alone
more noisy patterns
slight decrease in segregation

Being a loner should be fairly common under very low densities. If loners are always unhappy, they are more likely to end up in mixed neighborhoods, thereby decreasing overall segregation.

Notice that something funny seems to be happening for the lowest density and highest similarity threshold condition, as seen in the upper right of the rightmost plot in Figure 3.10. Average segregation dips down as a result of greater variation in outcomes. Playing around with individual runs shows that this is due to longer times needed to reach equilibrium for the lonely loners, often greater than the 100 time steps we allotted. The result is that the simulation ends before the population has reached its peak level of segregation. The spatial patterns of segregation that emerge at low densities also differ between the two conditions, as depicted in Figure 3.11.

This figure illustrates both the importance of examining the effects of seemingly minor assumptions, as well as the ways in which summary statistics can be limited in describing the true patterns present in a model system (or, indeed, in a real-world system).

p. 77

Reflections

high abstraction

you might feel uneasy because it might seem ‘too simple’
however: you will grow accustomed to this

p. 79

Going deeper

p. 80

Exploration

Computer simulations of language change notes